refactor scripts/finetune.py into new cli modules #550

winglian · 2023-09-11T13:42:24Z

WIP

update docs/README

NanoCode012

Just some quick look over. I didn't test it.

NanoCode012 · 2023-09-14T17:08:52Z

src/axolotl/cli/__init__.py

+
+    validate_config(cfg)
+
+    normalize_config(cfg)


Maybe this name should be setup_config? normalize sounds confusing

NanoCode012 · 2023-09-14T17:09:49Z

src/axolotl/cli/__init__.py

+        print(tokenizer.decode(generated["sequences"].cpu().tolist()[0]))
+
+
+def choose_config(path: Path):


Should this be removed? I think this causes problem on multi-gpu.

NanoCode012 · 2023-09-14T17:10:26Z

src/axolotl/cli/__init__.py

+                streamer=streamer,
+            )
+        print("=" * 40)
+        print(tokenizer.decode(generated["sequences"].cpu().tolist()[0]))


can we skip special tokens here?

NanoCode012 · 2023-09-14T17:11:32Z

src/axolotl/cli/__init__.py

+
+    LOG.info("running merge of LoRA with base model")
+    model = model.merge_and_unload()
+    model.to(dtype=torch.float16)


this should be set to bf16 if set, else fp16. We can close the other PR that deals with this.

NanoCode012 · 2023-09-14T17:13:05Z

src/axolotl/cli/__init__.py

+    cli_args: TrainerCliArgs,
+):
+    model, tokenizer = load_model_and_tokenizer(cfg=cfg, cli_args=cli_args)
+    safe_serialization = cfg.save_safetensors is True


This should be moved to the "normalize config" function.

scripts/finetune.py

NanoCode012 · 2023-09-14T17:14:31Z

src/axolotl/cli/shard.py

+    cli_args: TrainerCliArgs,
+):
+    model, _ = load_model_and_tokenizer(cfg=cfg, cli_args=cli_args)
+    safe_serialization = cfg.save_safetensors is True


Same here. If moved to "normalize config", this can be changed.

NanoCode012 · 2023-09-14T17:15:28Z

src/axolotl/cli/train.py

+    dataset_meta = load_datasets(cfg=parsed_cfg, cli_args=parsed_cli_args)
+    if parsed_cli_args.prepare_ds_only:
+        return
+    train(cfg=parsed_cfg, cli_args=parsed_cli_args, dataset_meta=dataset_meta)


safe tensor fix inside train.py also.

NanoCode012 · 2023-09-14T17:15:59Z

src/axolotl/cli/train.py

+    )
+
+    dataset_meta = load_datasets(cfg=parsed_cfg, cli_args=parsed_cli_args)
+    if parsed_cli_args.prepare_ds_only:


Should this be a separate command?

Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

* refactor scripts/finetune.py into new cli modules * continue to support scripts/finetune.py * update readme with updated cli commands * Update scripts/finetune.py Co-authored-by: NanoCode012 <kevinvong@rocketmail.com> --------- Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

winglian marked this pull request as draft September 11, 2023 13:42

winglian added 2 commits September 13, 2023 10:40

refactor scripts/finetune.py into new cli modules

f82d21c

continue to support scripts/finetune.py

b049704

winglian force-pushed the cli branch from e94c361 to b049704 Compare September 13, 2023 14:46

update readme with updated cli commands

4d3ee48

winglian requested review from NanoCode012 and tmm1 September 13, 2023 15:35

winglian marked this pull request as ready for review September 13, 2023 15:36

NanoCode012 reviewed Sep 14, 2023

View reviewed changes

Update scripts/finetune.py

ce4208a

Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

winglian merged commit 861ceca into main Sep 15, 2023
6 checks passed

winglian deleted the cli branch September 15, 2023 05:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor scripts/finetune.py into new cli modules #550

refactor scripts/finetune.py into new cli modules #550

winglian commented Sep 11, 2023 •

edited

Loading

NanoCode012 left a comment

NanoCode012 Sep 14, 2023

NanoCode012 Sep 14, 2023

NanoCode012 Sep 14, 2023

NanoCode012 Sep 14, 2023

NanoCode012 Sep 14, 2023

NanoCode012 Sep 14, 2023

NanoCode012 Sep 14, 2023

NanoCode012 Sep 14, 2023

		print(tokenizer.decode(generated["sequences"].cpu().tolist()[0]))


		def choose_config(path: Path):

refactor scripts/finetune.py into new cli modules #550

refactor scripts/finetune.py into new cli modules #550

Conversation

winglian commented Sep 11, 2023 • edited Loading

NanoCode012 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

winglian commented Sep 11, 2023 •

edited

Loading